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TITLE OF THE INVENTION 

BLOCK-CONSTRAINED TCQ METHOD, AND METHOD AND APPARATUS FOR 
QUANTIZING LSF PARAMETER EMPLOYING THE SAME IN SPEECH CODING SYSTEM 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims priority from Korean Patent Application No. 2003-10484, filed 
February 19, 2003, in the Korean Industrial Property Office, the disclosure of which is 
incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[0002] The present invention relates to a speech coding system, and more particularly, to a 
method and apparatus for quantizing line spectral frequency (LSF) using block-constrained 
Trellis coded quantization (BC-TCQ). 

2. Description of the Related Art 

[0003] For high quality speech coding in a speech coding system, it is very important to 
efficiently quantize linear predictive coding (LPC) coefficients indicating the short interval 
correlation of a voice signal. In an LPC filter, an optimal LPC coefficient value is obtained such 
that after an input voice signal is divided into frame units, the energy of the prediction error for 
each frame is minimized. In the third generation partnership project (3GPP), the LPC filter of an 
adaptive multi-rate wideband (AMR_WB) speech coder standardized for International Mobile 
Telecommunications-2000 (IMT-2000) is a 16-dimensional all-pole filter and at this time, for 
quantization of 16 LPC coefficients being used, many bits are allocated. For example, the IS- 
96A Qualcomm code excited linear prediction (QCELP) coder, which is the speech coding 
method used in the CDMA mobile communications system, uses 25% of the total bits for LPC 
quantization, and Nokia's AMR_WB speech coder uses a maximum of 27.3% to a minimum of 
9.6% of the total bits in 9 different modes for LPC quantization. 

[0004] So far, many methods for efficiently quantizing LPC coefficients have been developed 
and are being used in voice compression apparatuses. Among these methods, direct 
quantization of LPC filter coefficients has the problems that the characteristic of a filter is too 
sensitive to quantization errors, and stability of the LPC filter after quantization is not 
guaranteed. Accordingly, LPC coefficients should be converted into other parameters having a 



Docket No.: 1793.1215 

good compression characteristic and then quantized and reflection coefficients or LSFs are 
used. Particularly, since an LSF value has a characteristic very closely related to the frequency 
characteristic of voice, most of the recently developed voice compression apparatuses employ a 
LSF quantization method. 

[0005] In addition, if inter-frame correlation of LSF coefficients is used, efficient quantization 
can be implemented. That is, without directly quantizing the LSF of a current frame, the LSF of 
the current frame is predicted from the LSF information of past frames and then the error 
between the LSF and its prediction frames is quantized. Since this LSF value has a close 
relation with the frequency characteristic of a voice signal, this can be predicted temporally and 
in addition, can obtain a considerable prediction gain. 

[0006] LSF prediction methods include using an auto-regressive (AR) filter and using a 
moving average (MA) filter. The AR filter method has good prediction performance, but has a 
drawback that at the decoder side, the impact of a coefficient transmission error can spread into 
subsequent frames. Although the MA filter method has prediction performance that is typically 
lower than that of the AR filter method, the MA filter has an advantage that the impact of a 
transmission error is constrained temporally. Accordingly, speech compression apparatuses 
such as AMR, AMR_WB, and selectable mode vocoder (SMV) apparatuses that are used in an 
environment where transmission errors frequently occur, such as wireless communications, use 
the MA filter method of predicting LSF. Also, prediction methods using correlation between 
neighbor LSF element values in a frame, in addition to LSF value prediction between frames, 
have been developed. Since the LSF values must always be sequentially ordered for a stable 
filter, if this method is employed additional quantization efficiency can be obtained. 

[0007] Quantization methods for LSF prediction error can be broken down into scalar 
quantization and vector quantization (VQ). At present, the vector quantization method is more 
widely used than the scalar quantization method because VQ requires fewer bits to achieve the 
same encoding performance. In the vector quantization method, quantization of entire vectors 
at one time is not feasible because the size of the VQ codebook table is too large and codebook 
searching takes too much time. To reduce the complexity, a method by which the entire vector 
is divided into several sub-vectors and each sub-vector is independently vector quantized has 
been developed and is referred to as a split vector quantization (SVQ) method. For example, if 
in 10-dimensional vector quantization using 20 bits, quantization is performed for the entire 
vector, the size of the vector codebook table becomes 10 x2 20 . However, if a split vector 



Docket No.: 1793.1215 

quantization method is used, by which the vector is divided into two 5-dimensional sub-vectors 
and 10 bits are allocated for each sub-vector, the size of the vector table becomes just 5 x 2 10 x 
2. 

[0008] FIG. 1 A shows an LSF quantizer used in an AMR wideband speech coder having a 
multi-stage split vector quantization (S-MSVQ) structure, and FIG. 1B shows an LSF quantizer 
used in an AMR narrowband speech coder having an SVQ structure. In LSF coefficient 
quantization with 46 bits allocated, compared to a full search vector quantizer, the LSF quantizer 
having an S-MSVQ structure as shown in FIG. 1A has a smaller memory and a smaller amount 
of codebook search computation, but due to complexity of memory and codebook search, 
requires a larger amount of computation. Also, in the SVQ method, if the vector is divided into 
more sub-vectors, the size of the vector table decreases and the memory can be saved and 
search time can decrease, but the performance is degraded because the correlation between 
vector values is not fully utilized. In an extreme case, if 10-dimensional vector quantization is 
divided into 10 1 -dimensional vectors, it becomes scalar quantization. If the SVQ method is 
used and without LSF prediction between 20 msec frames, LSF is directly quantized, and 
acceptable quantization performance can be obtained using 24 bits per vector. However, since 
in the SVQ method each sub-vector is independently quantized, correlation between sub- 
vectors cannot be fully utilized and the entire vector cannot be optimized. 

[0009] Many VQ methods have been developed including a method by which vector 
quantization is performed in a plurality of operations, a selective vector quantization method by 
which two tables are used for selective quantization, and a link split vector quantization method 
by which a table is selected by checking a boundary value of each sub-vector. These methods 
of LSF quantization can provide transparent sound quality, provided the encoding rate is large 
enough. 

SUMMARY OF THE INVENTION 

[0010] The present invention also provides an apparatus and method by which by applying 
the block-constrained Trellis coded quantization method, line spectral frequency coefficients are 
quantized. 

[0011] According to an aspect of the present invention, there is provided a block-constrained 
(BC)-Trellis coded quantization (TCQ) method including: in a Trellis structure having total N 
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(N=2 V , here v denotes the number of binary memory elements in the finite-state machine 
defining the convolutional encoder) states, constraining the number of initial states of Trellis 
paths available for selection, within 2 k (0 £ k £ v) in total N states, and constraining the number 
of the states of a last stage within 2 v " k among total N states according to the initial states of 
Trellis paths; after referring to initial states of N survivor paths determined under the initial state 
constraint by the constraining from a first stage to stage L-log 2 N (here, L denotes the number of 
the entire stages and N denotes the number of entire Trellis states), considering Trellis paths in 
which the state of a last stage is selected among 2 v " k states determined by each initial state 
under the constraint that the state of a last stage is constrained by the remaining v stages; and 
obtaining an optimum Trellis path among the considered Trellis paths and transmitting the 
optimum Trellis path. 

[0012] According to another aspect of the present invention, there is provided a line spectral 
frequency (LSF) coefficient quantization method in a speech coding system comprising: 
removing the direct current (DC) component in an input LSF coefficient vector; generating a first 
prediction error vector by performing inter-frame and intra-frame prediction of the LSF 
coefficient vector, in which the DC component is removed, quantizing the first prediction error 
vector by using BC-TCQ algorithm, and then, by performing intra-frame and inter-frame 
prediction compensation, generating a quantized first LSF coefficient vector; generating a 
second prediction error vector by performing intra-frame prediction of the LSF coefficient vector, 
in which the DC component is removed, quantizing the second prediction error vector by using 
the BC-TCQ algorithm, and then, by performing intra-frame prediction compensation, generating 
a quantized second LSF coefficient vector; and selectively outputting a vector having a shorter 
Euclidian distance to the input LSF coefficient vector between the generated quantized first and 
second LSF coefficient vectors. 

[0013] According to still another aspect of the present invention, there is provided an LSF 
coefficient quantization apparatus in a speech coding system comprising: a first subtracter 
which removes the DC component in an input LSF coefficient vector and provides the LSF 
coefficient vector, in which the DC component is removed; a memory-based Trellis coded 
quantization unit which generates a first prediction error vector by performing inter-frame and 
intra-frame prediction for the LSF coefficient vector provided by the first subtracter, in which the 
DC component is removed, quantizes the first prediction error vector by using the BC-TCQ 
algorithm, and then, by performing intra-frame and inter-frame prediction compensation, 
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generates a quantized first LSF coefficient vector; a non-memory Trellis coded quantization unit 
which generates a second prediction error vector by performing intra-frame prediction for the 
LSF coefficient vector, in which the DC component is removed, quantizes the second prediction 
error vector by using BC-TCQ algorithm, and then, by performing intra-frame prediction 
compensation, generates a quantized second LSF coefficient vector; and a switching unit which 
selectively outputs a vector having a shorter Euclidian distance to the input LSF coefficient 
vector between the quantized first and second LSF coefficient vectors provided by the memory- 
based Trellis coded quantization unit and the non-memory-based Trellis coded quantization unit, 
respectively. 

[0014] Additional aspects and/or advantages of the invention will be set forth in part in the 
description which follows, and, in part, will obvious from the description, or may be learned by 
practice of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] These and/or other aspects and advantages of the invention will become apparent 
and more readily appreciated from the following description of the embodiments, taken in 
conjunction with the accompanying drawings of which: 

[0016] FIGS. 1 A and 1 B are block diagrams of quantizers applied to adaptive multi rate 
(AMR) wideband and narrowband speech coders proposed by 3rd generation partnership 
project (3GPP); 

[0017] FIG. 2 is a diagram showing the Trellis coded quantization (TCQ) structure and 
output level; 

[0018] FIG. 3 is a diagram showing the structure of Trellis path information in TCQ; 

[0019] FIG. 4 is a diagram showing the structure of Trellis path information in TB-TCQ; 

[0020] FIGs. 5A-5D are diagrams showing a Trellis path that should be considered in a 
single Viterbi encoding process according to an initial state when a TB-TCQ algorithm is used in 
a 4-state Trellis structure; 
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[0021] FIG. 6 is a block diagram showing the structure of a line spectral frequency (LSF) 
coefficient quantization apparatus according to an embodiment of the present invention in a 
speech coding system; 

[0022] FIG. 7 is a diagram showing Trellis paths that should be considered in a single 
Viterbi encoding process according to a constrained initial state when a BC-TCQ algorithm is 
used in a 4-state Trellis structure; 

[0023] FIG. 8 is a schematic diagram of a Viterbi encoding process in a non-memory Trellis 
coded quantization unit in FIG. 6; 

[0024] FIG. 9 is a schematic diagram of a Viterbi encoding process in a memory-based 
Trellis coded quantization unit in FIG. 6; 

[0025] FIGS. 10A through 10C are flowcharts explaining the BC-TCQ encoding process of 
the non-memory Trellis coded quantization unit in FIG. 6; 

[0026] FIGS. 1 1A through 1 1C are flowcharts explaining the BC-TCQ encoding process of 
the memory-based Trellis coded quantization unit in FIG. 6; and 

[0027] FIG. 12 is a flowchart explaining an LSF coefficient quantization method according to 
the present invention in a speech coding system. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0028] Reference will now be made in detail to the present embodiments of the present 
invention, examples of which are illustrated in the accompanying drawings, wherein like 
reference numerals refer to the like elements throughout. The embodiments are described 
below in order to explain the present invention by referring to the figures. 

[0029] Prior to detailed explanation of the present invention, the Trellis coded quantization 
(TCQ) method will now be explained. 

[0030] While ordinary vector quantizers require a large memory space and a large amount of 
computation, the TCQ method is characterized in that it requires a smaller memory size and a 
smaller amount of computation. An important characteristic of the TCQ method is quantization 
of an object signal by using a structured codebook which is constructed based on a signal set 
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expansion concept By using Ungerboeck's set partition concept, a Trellis coding quantizer 
uses an extended set of quantization levels, and codes an object signal at a desired 
transmission bit rate. The Viterbi algorithm is used to encode an object signal. At a 
transmission rate of R bits per sample, an output level is selected among 2 R+1 levels when 
encoding each sample. 

[0031] FIG. 2 is a diagram showing an output signal and Trellis structure for an input signal 
having a uniform distribution when 2 bits are allocated for a sample. Eight output signals are 
distributed, in an interleaved manner, in the sub-codebooks of DO, D1, D2, and D3, as shown in 
FIG. 2. When quantization object vector x is given, output signal (x ) minimizing distortion 
( d(x, x) ) is determined by using the Viterbi algorithm, and the output signal (x ) determined by 
the Viterbi algorithm is expressed using 1 -bit/sample information to indicate a corresponding 
Trellis path and (R-1 )-bits/sample information to indicate a codeword determined in the sub- 
codebook allocated to the corresponding Trellis path. These information bits are transmitted 
through a channel to a decoder, and the decoding process from the transmitted bit information 
items will now be explained. The bit indicating Trellis path information is used as an input to a 
rate-1/2 convolutional encoder, and the corresponding output bits of the convolutional encoder 
specify the sub-codebook. Trellis path information requires one bit of path information in each 
stage and initial state information. The number of additional bits required to express initial state 
information is log 2 N when the Trellis has N states. 

[0032] FIG. 3 is a diagram showing the overhead information of TCQ for a 4-state Trellis 
structure. In order to transmit Trellis path (thick dotted lines) information determined by the TCQ 
method, initial state information '01' should be additionally transmitted in addition to L bits of 
path information to specify L stages. Accordingly, when data is being quantized in units of 
blocks by the TCQ method, the object signal should be coded by using the remaining available 
bits excluding log 2 N bits among entire transmission bits in each block, which is the cause of its 
performance degradation. In order to solve this problem, Nikneshan and Kandani suggested a 
tail-biting (TB)-TCQ algorithm. Their algorithm puts constraints on the selection of an initial 
trellis state and a last state in a Trellis path. 

[0033] FIG. 4 is a diagram showing a Trellis path (thick dotted lines) quantized and selected 
by TB-TCQ method suggested by Nikneshan and Kandani. Since transmission of path change 
information in the last log 2 N stage is not needed, Trellis path information can be transmitted by 
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using a total of L bits, and additional bits are not needed like the traditional TCQ. That is, the 
TB-TCQ algorithm suggested by Nikneshan and Kandani solves the overhead problem of the 
conventional TCQ. However, from a quantization complexity point of view, the single Viterbi 
encoding process needed by the TCQ should be performed as many times as the number of 
allowed initial Trellis states. The maximal complexity TB-TCQ method allows all initial states, 
each pair with a single (nominally the same) final state, and therefore the complexity is obtained 
by multiplying that of TCQ by the number of trellis states. For example, FIGs. 5A-5D are 
diagrams showing Trellis paths (thick solid lines) that can be selected in each of a total of four 
Viterbi encoding processes in order to find an optimal Trellis path by using TB-algorithm 
suggested by Nikneshan and Kandani. 

[0034] FIG. 6 is a block diagram showing the structure of a line spectral frequency (LSF) 
coefficient quantization apparatus according to an embodiment of the present invention in a 
speech coding system. The LSF coefficient quantization apparatus comprises a first subtracter 
610, a memory-based Trellis coded quantization unit 620, a non-memory Trellis coded 
quantization unit 630 connected in parallel with the memory-based coded quantization unit 620, 
and a switching unit 640. Here, the memory-based Trellis coded quantization unit 620 
comprises a first predictor 621, a second predictor 624, a second subtracter 622, a third 
subtracter 625, first through fourth adders 623, 627, 628, and 629, and a first block-constrained 
Trellis coded quantization unit (BC-TCQ) 626. The non-memory coded quantization unit 630 
comprises fifth through seventh adders 631 , 635, and 636, a fourth subtracter 633, a third 
predictor 633, and a second BC-TCQ 634. 

[0035] Referring to FIG. 6, the first subtracter 610 subtracts the DC component (f DC (n) ) of 
an input LSF coefficient vector (f(n) ) from the LSF coefficient vector and the LSF coefficient 

vector (x(n) ), in which the DC component is removed, is applied as input to the memory-based 
Trellis coded quantization unit 620 and the non-memory Trellis coded quantization unit 630 at 
the same time. 

[0036] The memory-based Trellis coded quantization unit 620 receives the LSF coefficient 
vector (x(n) ), in which the DC component is removed, generates prediction error vector (f. (n) ) 
by performing inter-frame prediction and intra-frame prediction, quantizes the prediction error 
vector ) by using the BC-TCQ algorithm to be explained later, and then, by performing 
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intra-frame and inter-frame prediction compensation, generates the quantized and prediction- 
compensated LSF coefficient vector (x(n) ), and provides the final quantized LSF coefficient 

vector (f (n) ), which is obtained by adding the quantized and prediction-compensated LSF 

coefficient vector (x(n) ) and the DC component (f (n) ) of the LSF coefficient vector, and is 

applied as input to the switching unit 640. 

[0037] For this, MA prediction, for example, a fourth-order MA prediction algorithm is applied 
to the first predictor 621 and the first predictor 621 generates a prediction value obtained from 
prediction error vectors of previous frames (n-i, here i = 1 ...4) which are quantized and intra- 
frame prediction-compensated. The second subtracter 622 obtains prediction error vector 
(e(n) ) of the current frame (n) by subtracting the prediction value provided by the first predictor 
621 from the LSF coefficient vector (x(n) ), in which the DC component is removed. 

[0038] To the second predictor 624, AR prediction, for example a first-order AR prediction 
algorithm is applied and the second predictor 624 generates a prediction value obtained by 
multiplying prediction factor ( p i ) for the i-th element by the ,(i-1 )-th element value ( | w («) ) which 
is quantized by the first BC-TCQ 626 and intra-frame prediction-compensated by the first adder 
623. The third subtracter 625 obtains the prediction error vector of i-th element value (/,(») ) by 
subtracting the prediction value provided by the second predictor 624 from the i-th element 
value (e^n) ) in prediction error vector (e(n) ) of the current frame (n) provided by the second 
subtracter 622. 

[0039] The first BC-TCQ 626 generates the quantized prediction error vector with i-th 
element value (*,(*)), by performing quantization of the prediction error vector with i-th element 
value {t.(n) ), which is provided by the second subtracter 625, by using the BC-TCQ algorithm. 
The second adder 627 adds the prediction value of the second predictor 624 to the quantized 
prediction error vector with i-th element value {i s (n) ) provided by the first BC-TCQ 626, and by 
doing so, performs intra-frame prediction compensation for the quantized prediction error vector 
with i-th element value (i^n) ) and generates the i-th element value (e^n) ) of the quantized 
inter-frame prediction error vector. The element value of each order forms the quantized 
prediction error vector ( e(ri) ) of the current frame. 



Docket No.: 1793.1215 

[0040] The third adder 628 generates the quantized LSF coefficient vector (x(n) ), by adding 
the prediction value of the first predictor 612 to the quantized inter-frame prediction error vector 
(e(n) ) of the current frame provided by the second adder 627, that is, by performing inter-frame 
prediction compensation for the quantized prediction error vector (e(n) ) of the current frame. 
The fourth adder 629 generates the quantized LSF coefficient vector {f x (ri) ), by adding DC 
component (/ DC {n) ) of the LSF coefficient vector to the quantized LSF coefficient vector 

(x(n) ) provided by the third adder 628. The finally quantized LSF coefficient vector (/ (w) ) is 
provided to one end of the switching unit 640. 

[0041] The non-memory Trellis coded quantization unit 630 receives the LSF coefficient 
vector ( x(n) ), in which the DC component is removed, performs intra-frame prediction, 
generates prediction error vector (t^n) ), quantizes the prediction error vector (t { (n) ) by using 

the BC-TCQ algorithm, which will be explained later, then performs intra-frame prediction 
compensation, and generates the quantized and prediction-compensated LSF coefficient vector 
(x(n) ). The non-memory Trellis coded quantization unit 630 provides the switching unit 640 

with the finally quantized LSF coefficient vector (f (n) ), which is obtained by adding quantized 
and prediction-compensated LSF coefficient vector (x(n) ) and DC component (f DC (n) ) of the 
LSF coefficient vector. 

[0042] For this, AR prediction, for example, a first-order AR prediction algorithm is used in 
the third predictor 632 and the third predictor 632 generates a prediction value obtained by 
multiplying prediction element (p. ) for the i-th element by the intra-frame prediction error vector 

with (M)-th element (x M (/i) ) which is quantized by the second BC-TCQ 634 and then intra- 
frame prediction-compensated by the fifth adder 631 . The fourth subtracter 633 generates the 
prediction error vector with i-th element {t^n) ) by subtracting the prediction value provided by 
the third predictor 632 from the i-th element (jc f (n) ) of the LSF coefficient vector (x(ri) ), in 
which the DC component is removed, provided by the first subtracter 610. 

[0043] The second BC-TCQ 634 generates the quantized prediction error vector of i-th 
element value (/.(«) ), by performing quantization of the prediction error vector of i-th element 
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(t.(n) ), which is provided by the fourth subtracter 633, by using the BC-TCQ algorithm. The 
sixth adder 635 adds the prediction value of the third predictor 632 to the quantized prediction 
error vector of i-th element value (£,(«) ) provided by the second BC-TCQ 634, and by doing so, 
performs intra-frame prediction compensation for the quantized prediction error vector of i-th 
element value (ig(n) ) and generates the quantized and prediction-compensated LSF coefficient 
vector of i-th element value ( x t (n) ). The LSF coefficient vector of the element values of each 
order forms the quantized prediction error vector (e(n) ) of the current frame. The seventh 
adder 636 generates the quantized LSF coefficient vector (/ (w) ), by adding the quantized LSF 
coefficient vector (x(n) ) provided by the sixth adder 635 to the DC component if DC (n) ) of the 

LSF coefficient vector. The finally quantized LSF coefficient vector (/ («) ) is provided to one 
end of the switching unit 640. 

[0044] Between LSF coefficient vectors (/ («) , f 2 (n) ) quantized in the memory-based 

Trellis coded quantization unit 620 and the non-memory Trellis coded quantization unit 630, 
respectively, the switching unit 640 selects one that has a shorter Euclidian distance from the 
input LSF coefficient vector (f(n) ), and outputs the selected LSF coefficient vector. 

[0045] In the present embodiment, the fourth adder 629 and the seventh adder 636 are 
disposed in the memory-based Trellis coded quantization unit 620 and the non-memory Trellis 
coded quantization unit 630, respectively. In another embodiment, the fourth adder 629 and the 
seventh adder 636 may be removed and instead, one adder is disposed at the output end of the 
switching unit 640 so that the DC component (/ (w) ) of the LSF coefficient vector can be 

added to the quantized LSF coefficient vector (x(n) ) which is selectively output from the 
switching unit 640. 

[0046] The BC-TCQ algorithm used in the present invention will now be explained. 

[0047] The BC-TCQ algorithm uses a rate- 1/2 convolutional encoder and N-state Trellis 
structure (N=2 V , here, v denotes the number of binary state variables in the encoder finite state 
machine) based on an encoder structure without feedback. As prerequisites for the BC-TCQ 
algorithm, the initial states of Trellis paths that can be selected are limited to 2 k (0 < k < v) 
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among the total of N states, and the number of states of the last stage are limited to 2 v k (0 < k < 
v) among a total of N states, and dependent on the initial states of the Trellis path. 

[0048] In the process for performing single Viterbi encoding by applying this BC-TCQ 
algorithm, the N survivor paths determined under the initial state constraint are found from the 
first stage to a stage L-log 2 N (here, L denotes the number of entire stages, and N denotes the 
number of entire Trellis states. Then, in the encoding over the remaining v stages, only Trellis 
paths are considered which terminate in a state of the last stage selected among 2 v " k (0 < k < v) 
states determined according to each initial state. Among the considered Trellis paths, an 
optimum Trellis path is selected and transmitted. 

[0049] FIG. 7 is a diagram showing Trellis paths that are considered when using the BC-TCQ 
algorithm with k being 1 and a Trellis structure with a total of 4 states. In this example, 
constraints are given such that the initial states of Trellis paths that can be selected are '00' and 
MO' among 4 states, and the state of the last stage is '00' or *01' when the initial state is '00' and 
'1 0* or '1 1 ' when the initial state is '1 0\ Referring to FIG. 7, since the initial state of survivor path 
(thick dotted lines) determined to state '00' in stage L-log 2 4 is '00', Trellis paths that can be 
selected in the remaining stages are marked by thick dotted lines with the states of the last 
stage being W and '01\ 

[0050] Next, the BC-TCQ encoding process performed in Trellis paths selected as shown in 
FIG. 7 in the memory-based Trellis coded quantization unit 620 will now be explained referring 
to FIG. 8 and FIGS. 1 0A through 10C. 

[0051] The Viterbi encoding process in the j-th stage in FIG. 8 or FIG. 10A will first be 
explained. Unlike x j in BC-TCQ encoding process in the non-memory Trellis coded quantization 
unit 630, the quantization object signals related to state p of the j-th stage are e'= x J -ju J -xj,' 1 
and e" = x j - /j j -xj«~ x , and vary depending on the state of the previous stage. This is shown in 
FIGS. 10A through 10C. In operation 101, initialization of the entire distance {p° p ) at state p in 

stage 0 is performed, and in operations 102 and 103, N survivor paths are determined from the 
first stage-to-stage L-log 2 N (here, L denotes the number of entire stages and N denotes the 
number of entire Trellis states). That is, in operation 102a, for N states from the first stage to 
stage L-log 2 N, quantization distortion (d Vp9 d r p ) for a quantization object signal obtained by 
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operation 102a-1 is obtained as the following equations 1 and 2 by using a corresponding sub- 
codebook, and stored in distance metric (d rp ,d rp ) in operation 102a-2: 

d i , p =mm(d(e\y r J\y i , p eD/, p ) (1) 

rf r>p = minCrfCe",^) | y rfP e (2) 

[0052] In equations 1 and 2, D J V denotes a sub-codebook allocated to a branch between 

state p in the j-th stage and state i' in the G-1)-th stage, and D{ np denotes a sub-codebook 
allocated to a branch between state p in the j-th stage and state i" in the (j-1 )-th stage. Here, y,. iP 
and y PiP denote code vectors in Df* p and Df„ p , respectively. 

[0053] Then, a process for selecting one between two Trellis paths connected to state p in 
the j-th stage and an accumulated distortion update process are performed as the following 
equation 3 (operation 102b-1 in operation 102b): 

pi=nM P r^d i ^plr^d r , p ) (3) 

[0054] Then, when state i' of the previous stage between the two paths is determined, the 
quantization value for x j at state p in j-th stage is obtained as the following equation 4 (operation 
102b-2 in operation 102b): 

x J p =e'+ju J -xf l (4) 

[0055] Next, in operation 104, in the remaining v stages, the only Trellis paths considered are 
those for which the state of the last stage is selected among 2 v k (0 £ k < v) states determined 
according to each initial state are considered. For this, in operation 104a, the initial state of 
each of N survivor paths determined as in the operation 103 and 2 v ' k (0 < k < v) Trellis paths in 
the last v stages are determined in operation 104a. 

[0056] In operations 1 04b through 1 04e, for each of 2 v_k (0 < k < v) states defined according 
to each initial state value in the entire N survivor paths, information on a Trellis path that has the 



-13- 



Docket No.: 1793.1215 

shortest distance between an input sequence and a quantized sequence in a path determined 
to the last state, and the codeword information are obtained. In the operations 104b through 
104e, pf n denotes the entire distance between an input sequence and a quantized sequence in 

a path determined to the last state (n=1, ... 2 vk ) in survivor path i, and dj n denotes the distance 

between the quantization value of input sample Xj and the input sample in a path determined to 
the last state (n=1, ... 2 v " k ) in survivor path i. 

[0057] Next, the BC-TCQ encoding process performed in Trellis paths selected as shown in 
FIG. 7 in the non-memory Trellis coded quantization unit 630 will now be explained referring to 
FIG. 9 and FIGS. 11A through 11C. 

[0058] Constraints on the initial state and last state are the same as in the BC-TCQ encoding 
process in the memory-based Trellis coded quantization unit 620, but inter-frame prediction of 
input samples is not used. 

[0059] First, the Viterbi encoding process in the j-th stage of FIG. 9 will now be explained, 
referring to FIGS. 11A through 11C. 

0 

[0060] In operation 111, initialization of the entire distance ( Pp ) at state p in stage 0 is 
performed, and in operations 112 and 113, N survivor paths are determined from the first stage- 
to-stage L-log 2 N (here, L denotes the number of entire stages and N denotes the number of 
entire Trellis states). That is, in operation 1 12a, for N states from the first stage to stage L- 
log 2 N, quantization distortion (d rp ,d rp ) is obtained as the equations 5 and 6 by using sub- 
codebooks allocated to two branches connected to state p in j-th stage, and stored in distance 
metric {d Vp ,d rp ): 



</,,,= min (d(x\ yi , p )\y rtP eD/, p ) (5) 

yi:p eD i; P 

d r , p = min (d(x<\y r , p )\y r , p eD/„J (6) 
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[0061] In equations 5 and 6, D( p denotes a sub-codebook allocated to a branch between 

state p in j-th stage and state i' in Q-1)-th stage, and Df„ p denotes a sub-codebook allocated to 
a branch between state p in j-th stage and state i" in G-1)-th stage. Here, y i tP and yr (P denote 
code vectors in D s v and Dj» p , respectively. 

[0062] Then, a process for selecting one among two Trellis paths connected to state p in j-th 
stage and an accumulated distortion update process are performed as equation 7 and 
according to the result, a path is selected and x j p is updated (operation 1 12b-1 and 112b-2 in 
operation 112b): 

p j p = minOs/" 1 + d Vp , pj: x + d rp ) (7) 

[0063] The sequence and functions of the next operation, operation 114, are the same as 
that of the operation 104 shown in FIG. 10C. 

[0064] Thus, unlike the TB-TCQ algorithm, the BC-TCQ algorithm according to the present 
invention enables quantization by a single Viterbi encoding process such that the additional 
complexity in the TB-TCQ algorithm can be avoided. 

[0065] FIG. 12 is a flowchart explaining an LSF coefficient quantization method according to 
the present invention in a speech coding system. The method comprises DC component 
removing operation 121, memory-based Trellis coded quantization operation 122, non-memory 
Trellis coded quantization operation 123, switching operation 124 and DC component 
restoration operation 125. Here, DC component restoration operation 125 can be implemented 
by including the operation into the memory-based Trellis coded quantization operation 122 and 
the non-memory Trellis coded quantization operation 123. 

[0066] Referring to FIG. 12, in operation 121, the DC component (f DC (n) ) of an input LSF 
coefficient vector (f(n) ) is subtracted from the LSF coefficient vector and the LSF coefficient 
vector (x(n) ) in which the DC component is removed is generated. 

[0067] In operation 122, the LSF coefficient vector (x(n) ), in which the DC component is 
removed in the operation 121, is received, and by performing inter-frame and intra-frame 
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predictions, prediction error vector (/,.(«) ) is generated. The prediction error vector (* f (w) ) is 
quantized by using the BC-TCQ algorithm, and then, by performing intra-frame and inter-frame 
prediction compensation, quantized LSF coefficient vector {x(n) ) is generated, and Euclidian 
distance {d memory ) between quantized LSF coefficient vector (x(n) ) and the LSF coefficient 
vector {x(n) ), in which the DC component is removed, is obtained. 

[0068] The operation 122 will now be explained in more detail. In operation 122a, MA 
prediction, for example, 4-dimensional MA inter-frame prediction, is applied to the LSF 
coefficient vector (x(n) ), in which the DC component is removed in operation 121, and 
prediction error vector (e(n) ) of the current frame (n) is obtained. Operation 122a can be 
expressed as the following equation 8: 

e(n) = x(n)-f d e(n-i) (8) 

1=1 

[0069] Here, e{n-i) denotes prediction error vector of the previous frame (n-i, here i=1,... 4) 
which is quantized using the BC-TCQ algorithm and then intra-frame prediction-compensated. 

[0070] In operation 122b, AR prediction, for example, 1 -dimensional AR intra-frame 
prediction, is applied to the i-th element value ) in the prediction error vector (e(n) ) of the 

current frame (n) obtained in operation 122a, and prediction error vector (t^n) ) of the i-th 
element value is obtained. The AR prediction can be expressed as the following equation 9: 

t i (n) = e i (n)-p r e i _ l (n) (9) 

[0071] Here, p { denotes the prediction factor of i-th element, and e M (n) denotes the (i-1)-th 

element value which is quantized using the BC-TCQ algorithm and then, intra-frame prediction- 
compensated. 

[0072] Next, the prediction error vector with i-th element value ) obtained by the 
equation 9 is quantized using the BC-TCQ algorithm and the quantized prediction error vector of 
i-th element value (£,(«) ) is obtained. Intra-frame prediction compensation is performed for the 
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quantized prediction error vector with i-th element value (/)(«) ) and the LSF coefficient vector 

with i-th element value (£, («) ) is obtained. LSF coefficient vector of the element value of each 

order forms quantized inter-frame prediction error vector (e(n) ) of the current frame. The intra- 
frame prediction compensation can be expressed as the following equation 10: 

e i (n) = t i (n) + p r e i _ l (n) (10) 

[0073] In operation 122c, inter-frame prediction compensation is performed for quantized 
inter-frame prediction error vector (e(n) ) of the current frame obtained in the operation 122b 
and quantized LSF coefficient vector (x(n) ) is obtained. The operation 122c can be expressed 
as the following equation 11: 

4 

x(n) = e(n) + J] e{n - i) (1 1) 

[0074] In operation 122d, Euclidian distance (d memory = d(x 9 x) ) between quantized LSF 

coefficient vector {x(n) ) obtained in operation 122c and the LSF coefficient vector (x(n) ) input 
in operation 122a, in which the DC component is removed, is obtained. 

[0075] In operation 123, the LSF coefficient vector (x(n) ), in which the DC component is 
removed in the operation 121, is received, and by performing intra-frame prediction, prediction 
error vector ) is generated. The prediction error vector (t.(n) ) is quantized by using the 
BC-TCQ algorithm and intra-frame prediction compensated, and by doing so, quantized LSF 
coefficient vector {x(n) ) is generated. Euclidian distance {d memoryless ) between quantized LSF 

coefficient vector (x(n) ) and the LSF coefficient vector (x(n) ), in which the DC component is 
removed, is obtained. 

[0076] Operation 123 will now be explained in more detail. In operation 123a, AR prediction, 
for example, 1 -dimensional AR intra-frame prediction, is applied to the LSF coefficient vector 
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(x(n) ), with i-th element {x^n) ), in which the DC component is removed in operation 121, and 
intra-frame prediction error vector with i-th element (/.(w) ) is obtained. The AR prediction can 
be expressed as the following equation 12: 

^W = ^(")-A'i.W (12) 

[0077] Here, p. denotes the prediction factor of the i-th element, and denotes intra- 

frame prediction error vector of the (i-1 )-th element which is quantized by BC-TCQ algorithm 
and then, intra-frame prediction-compensated. 

[0078] Next, the intra-frame prediction error vector with i-th element (t^n) ) obtained by 
equation 12 is quantized using the BC-TCQ algorithm and the quantized intra-frame prediction 
error vector with i-th element (i^n) ) is obtained. Intra-frame prediction compensation is 
performed for the quantized intra-frame prediction error vector with i-th element ) and the 
quantized LSF coefficient vector with i-th element value ( x^n) ) is obtained. The quantized LSF 
coefficient vector of the element value of each order forms the quantized LSF coefficient vector 
(x(n) ) of the current frame. The intra-frame prediction compensation can be expressed as the 
following equation 13: 

*>) = '>) + A'*mW (13) 

[0079] In operation 123b, Euclidian distance (d memory = d{x,x) ) between the quantized LSF 

coefficient vector {x(n) ) obtained in operation 123a and LSF coefficient vector (x(n) ) input in 
the operation 123a, in which the DC component is removed, is obtained. 

[0080] In operation 124, Euclidian distances (d memory1 d memoryless ), obtained in operations 122d 
and 123b, respectively, are compared and the quantized LSF coefficient vector (x(n) ) with the 
smaller Euclidian distance is selected. 
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[0081] In operation 125, the DC component (f DC (n) ) of the LSF coefficient vector is added 
to the quantized LSF coefficient vector {x(n) ) selected in the operation 124 and finally the 
quantized LSF coefficient vector (f(n) ) is obtained. 

[0082] Meanwhile, the present invention may be embodied in a code, which can be read by a 
computer, on a computer readable recording medium. The computer readable recording 
medium includes all kinds of recording apparatuses on which computer readable data are 
stored. 

[0083] The computer readable recording media includes storage media such as magnetic 
storage media (e.g., ROM's, floppy disks, hard disks, etc.), optically readable media (e.g., CD- 
ROMs, DVDs, etc.) and carrier waves (e.g., transmissions over the Internet). Also, the 
computer readable recording media can be scattered on computer systems connected through 
a network and can store and execute a computer readable code in a distributed mode. Also, 
function programs, codes and code segments for implementing the present invention can be 
easily inferred by programmers in the art of the present invention. 

<Experiment Examples> 

[0084] In order to compare performances of BC-TCQ algorithm proposed in the present 
invention and the TB-TCQ algorithm, quantization signal-to-noise ratio (SNR) performance for 
the memoryless Gaussian source (mean 0, dispersion 1) was evaluated. Table 1 shows SNR 
performance value comparison with respect to block length. Trellis structure with 16 states and 
a double output level was used in the performance comparison experiment and 2 bits were 
allocated for each sample. The reference TB-TCQ system allowed 16 initial trellis states, with a 
single (identical to the initial state) final state allowed for each initial state. 



Table 1 



Block length 


TB-TCQ(dB) 


BC-TCQ(dB) 


16 


10.53 


10.47 


32 


10.70 


10.68 


64 


10.74 


10.76 


128 


10.74 


10.82 
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[0085] Referring to table 1 , when block lengths of the source are 1 6 and 32, the TB-TCQ 
algorithm showed the better SNR performance, while when block lengths of the source are 64 
and 128, BC-TCQ algorithm showed the better performance. 

[0086] Table 2 shows complexity comparison between BC-TCQ algorithm proposed in the 
present invention and TB-TCQ algorithm, when the block length of the source is 16 as illustrated 
in table 1. 



Table 2 



Operation 


TB-TCQ 


BC-TCQ 


Remarks 


Addition 


5184 


696 


86.57% decrease 


Multiplication 


64 


64 




Comparison 


2302 


223 


90.32% decrease 



[0087] Referring to table 2, in addition and comparison operations, the complexity of the BC- 
TCQ algorithm according to the present invention greatly decreased compared to that of the TB- 
TCQ algorithm. 

[0088] Meanwhile, the number of initial states that can be held in a 16-state Trellis structure 
is 2 k (0 < k < v) and table 3 shows comparison of quantization performance for a memoryless 
Laplacian signal using BC-TCQ when k=0, 1, 4. The codebook used in the performance 
comparison experiment has 32 output levels and the encoding rate is 3 bits per sample. 



Table 3 



Order, k 


Block length, L 


L=8 


L=16 


L=32 


K=64 


k=0 


13.6287 


14.4819 


15.1030 


15.5636 


k=1 


14.7567 


15.2100 


15.5808 


15.8499 
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k=2 


14.9591 


15.4942 


15.7731 


15.9887 


k=3 


13.4285 


14.5864 


15.3346 


15.7704 


k=4 


11.6558 


13.2499 


14.4951 


15.2912 



[0089] Referring to table 3, it is shown that when k=2, the BC-TCQ algorithm has the best 
performance. When k=2, 4 states of a total 16 states were allowed as initial states in the BC- 
TCQ algorithm. Table 4 shows initial state and last state information of BC-TCQ algorithm when 
k=2. 



Table 4 



Initial states 


Last states 


0 


0,1,2,3 


4 


4, 5, 6, 7 


8 


8, 9, 10, 11 


12 


12, 13, 14, 15 



[0090] Next, in order to evaluate the performance of the present invention, voice samples for 
wideband speech provided by NTT were used. The total length of the voice samples is 13 
minutes, and the samples include male Korean, female Korean, male English and female 
English. In order to compare with the performance of the LSF quantizer S-MSVQ used in 3GPP 
AMR_WB speech coder, the same process as the AMFM/VB speech coder was applied to the 
preprocessing process before an LSF quantizer, and comparison of spectral distortion (SD) 
performances, the amounts of computation, and the required memory sizes are shown in tables 
5 and 6. 
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Table 5 





AMR WB S-MSVQ 


Present invention 


SD 


Average SD(dB) 


0.7933 


0.6979 


2-4 dB(%) 


0.4099 


0.1660 


> 4dB(%) 


0.0026 


0 



Tab 


e6 






AMR_WB 


Present 
invention 


Remarks 


Computation 


Addition 


15624 


3784 


76% decrease 


amount 


Multiplication 


8832 


2968 


66% decrease 




Comparison 


3570 


2335 


35% decrease 


Memory requirement 


5280 


1056 


80% decrease 



[0091] Referring to tables 5 and 6, in SD performance, the present invention showed a 
decrease of 0.0954 in average SD, and a decrease of 0.2439 in the number of outlier 
quantization areas between 2dB~4dB, compared to AMFM/VB S-MSVQ. Also, the present 
invention showed a great decrease in the amount of computation needed in addition, 
multiplication, and comparison that are required for codebook search, and accordingly, the 
memory requirement also decreased correspondingly. 

[0092] According to the present invention as described above, by quantizing the first 
prediction error vector obtained by inter-frame and intra-frame prediction using the input LSF 
coefficient vector, and the second prediction error vector obtained in intra-frame prediction, 
using the BC-TCQ algorithm, the memory size required for quantization and the amount of 
computation in the codebook search process can be greatly reduced. 

[0093] In addition, when data analyzed in units of frames is transmitted by using Trellis coded 
quantization algorithm, additional transmission bits for initial states are not needed and the 
complexity can be greatly reduced. 
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[0094] Further, by introducing a safety net, error propagation that may take place by using 
predictors is prevented such that outlier quantization areas are reduced, the entire amount of 
computation and memory requirement decrease and at the same time the SD performance 
improves. 

[0095] Although a few embodiments of the present invention have been shown and 
described, it will be appreciated by those skilled in the art that changes may be made in these 
elements without departing from the principles and spirit of the invention, the scope of which is 
defined in the appended claims and their equivalents. 
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